CDS

Accession Number TCMCG021C21388
gbkey CDS
Protein Id XP_010933717.1
Location join(21414734..21414829,21415251..21415365,21415465..21415544,21419049..21419124,21419210..21419274,21419529..21419591,21421574..21421655,21421752..21421914,21425045..21425732)
Gene LOC105054037
GeneID 105054037
Organism Elaeis guineensis

Protein

Length 475aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268357
db_source XM_010935415.3
Definition U1 small nuclear ribonucleoprotein 70 kDa isoform X2 [Elaeis guineensis]

EGGNOG-MAPPER Annotation

COG_category A
Description U1 small nuclear ribonucleoprotein
KEGG_TC -
KEGG_Module M00351        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
KEGG_ko ko:K11093        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03040        [VIEW IN KEGG]
map03040        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGGAGACTACAACGATGCCATGATGCGCAACAACGCCGCTGTCCAGGCCCGCACGAAAGCCCAGAACCGAGCTAACGTCCTTCAGCTAAAACTGATTGGGCAGAGTCATCCAACTGGCCTTACCACCAATCTTTTGAAGCTATTTGAACCCCGGCCTCCTTTGGAGTATAAGCCTCCTATCGACAAGAGGAAATGCCCTCCATATACTGGCATGGCACAGTTGGTGAGTCATTTTGCCGAGCCTGGGGATCCTGAATATGCTCCGCCCGTCGAAAAGGGTGAAACTATGGCACAGAAAAGAGCTAGAATCCACACGCTTCGGCTGGAGAAAGGTGCAATAAAAGCTGCTGAAGAACTTGAGAAATATGATCCAAATAAAGACCCTAATATAACTGGGGATCCATACAAGACATTGTTTGTGGCAAAGCTTAACTATGAGACTACTGAGCATAGGATCAAAAGGGAGTTTGAAGCTTATGGGCCAATCAAACGGGTCCGGCTGATTACCGACAAGGTGACAAATAAACCTAGAGGATATGCCTTCATCGAGTACATGCATACTCGGGATATGAAAACTGCTTACAAGCAAGCTGATGGGAGGAAAGTGGATGGTAAGCGGGTACTTGTGGATGTTGAGCGTGGTAGAACTGTTCCAAATTGGCGACCTCGAAGATTGGGTGGTGGACTTGGATCAACCAGGATTGGAGGTGAAGAGGTTAATCAGAAGTATTCTGGCAGGGACAAGGGAAAATCTCGAGAACGAGGGAGGGAAAGGGAACGAGACCAGGAGAGGTCACATGAAAGGTCCCATGACAAGGCACGAGATCGTGATACAAGAGAAGATAGGCACCACCACAGAGACCGAGATAGGAATAGGGACAGAGACAGGGAAAGAGACCGTGGGCGAGACCGTGATCGAGCTCGTGACAGAGACAGGGAGAGAGACCGTGGCCGTGACTATGATAGAGATCGGGAACGTGATCGTGACCGTCCTCGGGAGAGGGAACGTGACAGAGATTATGACCATGCAAGCCATGAAAGAAACCGTGGGCAGTCACATGACAGGGATACTGACTATGATTATATGGAACCAAAGCATGATGGGGAGCTGCCTGAAGTGAAGGCAAGAGACTTTGATCATGGAGAACCAAATCATGGACAAGAGTGGTATGATGGGCCTAAGCATGGGAATGAACATGATTATCCATTTGAGCAACAGAGAAATCAGGAACAATATGATTTCCAACTCCATGGCCTCGGTGATCCTCAGAATGATTCAGAGCGCTCTAGGCGCCAAGACCATGAATACCATAACCATGTGCCTTATGATAAGGTGGATCATGTCAATTATCATGGTCAATTTAACCATGTTGAATCTGAATCACGTGAGGAGGGTGAGGCATTTGGTGACCAGGACTATGAGTAG
Protein:  
MGDYNDAMMRNNAAVQARTKAQNRANVLQLKLIGQSHPTGLTTNLLKLFEPRPPLEYKPPIDKRKCPPYTGMAQLVSHFAEPGDPEYAPPVEKGETMAQKRARIHTLRLEKGAIKAAEELEKYDPNKDPNITGDPYKTLFVAKLNYETTEHRIKREFEAYGPIKRVRLITDKVTNKPRGYAFIEYMHTRDMKTAYKQADGRKVDGKRVLVDVERGRTVPNWRPRRLGGGLGSTRIGGEEVNQKYSGRDKGKSRERGRERERDQERSHERSHDKARDRDTREDRHHHRDRDRNRDRDRERDRGRDRDRARDRDRERDRGRDYDRDRERDRDRPRERERDRDYDHASHERNRGQSHDRDTDYDYMEPKHDGELPEVKARDFDHGEPNHGQEWYDGPKHGNEHDYPFEQQRNQEQYDFQLHGLGDPQNDSERSRRQDHEYHNHVPYDKVDHVNYHGQFNHVESESREEGEAFGDQDYE